Non-volatile memory cell fabricated with slight modification to a conventional logic process and methods of operating same

ABSTRACT

A non-volatile memory cell is fabricated using a conventional logic process, with minor modifications. The cell is fabricated by forming a shallow trench isolation (STI) region in a well region of a semiconductor substrate. A recessed region is formed in the STI region, wherein the recessed region extends into the STI region and exposes a sidewall region in the well region. A capacitor region is formed in the sidewall region. A dielectric layer is formed over the well region, including the sidewall region. A gate electrode is then formed over the dielectric layer, wherein a portion of the gate electrode extends into the recessed region. An access transistor of the cell is then formed in a self-aligned manner with respect to the gate electrode. A capacitor structure is formed by the gate electrode (in the recessed region), the dielectric layer on the sidewall region, and the capacitor region.

RELATED APPLICATIONS

The present application is a continuation-in-part of U.S. patent application Ser. No. 09/415,032 filed by Fu-Chieh Hsu and Wingyu Leung on Oct. 7, 1999 now U.S. Pat. No. 6,457,108.

FIELD OF THE INVENTION

The present invention relates to non-volatile memory (NVM). More particularly, this invention relates to non-volatile memory cells fabricated by slightly modifying an application specific integrated circuit (ASIC) or conventional logic process. In the present application, a conventional logic process is defined as a semiconductor process that implements single-well or twin-well technology and uses a single layer of polysilicon. This invention further relates to a method of operating a non-volatile memory to ensure maximum data retention time.

BACKGROUND OF INVENTION

For system-on-chip (SOC) applications, it is desirable to integrate many functional blocks into a single integrated circuit. The most commonly used blocks include a microprocessor or micro-controller, static random access memory (SRAM) blocks, non-volatile memory blocks, and various special function logic blocks. However, traditional nonvolatile memory processes, which typically use stacked gate or split-gate memory cells, are not compatible with a conventional logic process. The combination of a nonvolatile memory process and a conventional logic process results in much more complicated and expensive “merged non-volatile memory and logic” process to implement system-on-chip integrated circuits. This is undesirable because the typical usage of the non-volatile memory block in an SOC application is small in relation to the overall chip size.

There are several prior art approaches to minimize the complexity of such a merged non-volatile memory and logic process. For example, U.S. Pat. No. 5,879,990 to Dormans et al. describes a process that requires at least two layers of polysilicon and two sets of transistors to implement both the normal logic transistors and the non-volatile memory transistors. This process is therefore more complex than a conventional logic process, which requires only a single layer of polysilicon.

U.S. Pat. No. 5,301,150 to Sullivan et al. describes a single poly process to implement a non-volatile memory cell. In this patent, the control gate to floating gate coupling is implemented using an n-well inversion capacitor. The control gate is therefore implemented using the n-well. An injector region must be coupled to the inversion layer in the n-well. The use of an n-well as the control gate and the need for an injector region result in a relatively large cell size.

U.S. Pat. No. 5,504,706 to D'Arrigo et al. describes a single poly process to implement a non-volatile memory cell that does not use an n-well as a control gate. FIG. 1A is a schematic diagram illustrating an array of nonvolatile memory cells C00-C11 as described by D'Arrigo et al. FIG. 1B is a cross sectional view of one of these non-volatile memory cells. As shown in FIG. 1A, each of the memory cells contains a transistor 24 having a source connected to a virtual-ground (VG) line and a drain connected to a bit line (BL). The transistor 24 further has a floating gate 40 which is coupled to a word line (WL) 86 through a coupling capacitor. The coupling capacitor includes n+ region 80 (FIG. 1B), which is located under the floating gate 40 and which is continuous with the diffusion word line 86. The capacitance of the coupling capacitor is significantly larger than the gate capacitance of the transistor to allow effective gate control of the transistor from the WL voltage levels. The n+ region 80 is formed by an additional implant to ensure good coupling during operations. This additional implant is not available in a standard logic process. The memory cells 24 are located inside a triple-well structure. More specifically, the memory cells are formed in a p-well or tank 78, which in turn, is formed in an n− tank 76, which in turn, is formed in p-well or substrate 74. A p+ contact region 88 is located in p− tank 78, and an n+ contact region 90 is located in n− tank 76. The triple-well structure allows flexibility of biasing in operating the memory cell. More specifically, the triple-well structure allows a large negative voltage (typically −9 Volts) to be applied to the word line 86 (i.e., the control gate). Both the extra n+ implant and the triple-well are not available in a conventional logic process. Similarly, U.S. Pat. No. 5,736,764 to Chang describes a p-channel metal-oxide-semiconductor (PMOS) cell having both a select gate and a control gate, wherein additional implants are required underneath the control gate.

In addition, the above-described non-volatile memory cells use a relatively thick tunneling oxide (typically 9 nanometers or more). Such a thick tunneling oxide is not compatible with conventional logic processes, because conventional logic processes provide for logic transistors having a gate oxide thickness of about 5 nm for a 0.25 micron process and 3.5 nm for a 0.18 micron process.

Conventional non-volatile memory cells typically require special high voltage transistors to generate the necessary high voltages (typically 8 Volts to 15 Volts) required to perform program and erase operations of the nonvolatile memory cells. These high voltage transistors are not available in a conventional logic process. These high voltage transistors are described, for example, in U.S. Pat. No. 5,723,355 to Chang et al.

U.S. Pat. No. 5,761,126 to Chi et al. describes a single poly electrically programmable read only memory (EPROM) cell that utilizes band-to-band tunneling in silicon to generate channel hot-electrons to be injected into a floating gate from a control gate. A relatively thin tunnel oxide can be used in this memory cell because of the enhanced electron injection. However, this memory cell only supports programming (i.e., electron injection into the floating gate). No support is provided to remove electrons from the floating gate (i.e., an erase operation is not supported).

The use of a thin gate oxide as tunneling oxide presents a challenge for achieving acceptable data retention time for non-volatile memory cells. A thin gate oxide is defined herein as a gate oxide layer having a thickness in the range of 1.5 nanometers (nm) to 6.0 nm. Although programming voltages may be reduced by the use of a thin gate oxide, the thin gate oxide will exacerbate cell disturbances. That is, the thin gate oxide will significantly increase the probability of spurious charge injection or removal from the floating gate during normal program, erase and read operations. This is due to the high electric field present in or near the thin gate oxide. As conventional logic processes scale down in geometry, the gate oxide thickness scales down proportionally. For example, a 0.25 micron process uses a 5 nm gate oxide thickness, a 0.18 micron process uses a 3.5 nm gate oxide thickness, and a 0.15 micron process uses a 3 nm gate oxide thickness. As a result, data-retention becomes a serious problem when using the standard gate oxide as the tunnel oxide in a non-volatile memory cell. U.S. Pat. No. 5,511,020 to Hu et al. describes data refreshing techniques to improve data retention time using very thin tunnel oxides.

It would therefore be desirable to implement a single-poly non-volatile memory cell using a conventional logic process, without requiring process modification and/or additional process steps.

It would also be desireable to implement a single-poly non-volatile memory cell that achieves reduced cell area with minor modifications to a conventional logic process.

It would also be desirable to have a method of operating non-volatile memory cells in conjunction with volatile memory arrays in a manner that minimizes disturbances from write, erasing and read operations, thereby improving the data retention time for the non-volatile memory cells.

SUMMARY

Accordingly, the present invention provides a non-volatile memory cell fabricated using a conventional logic process. The non-volatile memory cell uses a thin gate oxide (i.e., 1.5 nm to 6 nm) available in a conventional logic process. The non-volatile memory cell can be programmed and erased using relatively low voltages. The voltages required to program and erase can be provided by transistors readily available in a conventional logic process (i.e., transistors having a breakdown voltage in the range of 3 Volts to 7 Volts).

In one embodiment, the non-volatile memory cell includes a p-type semiconductor substrate and an n-well located in the substrate. A PMOS transistor is fabricated in the n-well. The PMOS transistor includes the thin gate oxide and an overlying polycrystalline silicon gate. An NMOS capacitor structure is fabricated in the p-type substrate. The NMOS capacitor structure includes an n-type coupling region located in the p-type substrate. The n-type coupling region is formed by the n-type source/drain implants, thereby eliminating the need for any additional implants not normally provided by the conventional logic process. The thin gate oxide and the polycrystalline silicon gate extend over the p-type substrate and the n-type coupling region, thereby forming the NMOS capacitor structure. The NMOS capacitor structure and the PMOS transistor are sized such that the NMOS capacitor structure has a capacitance larger than a capacitance of the PMOS transistor. Advantageously, a triple-well structure is not required by the present invention.

In another embodiment of the present invention, an NVM cell is fabricated by slightly modifying a conventional logic process. In this embodiment, the NVM cell is fabricated by forming a recessed PMOS coupling capacitor, which achieves significant area reduction, with minor modifications to a conventional logic process. More specifically, the NVM cell is fabricated by forming a shallow trench isolation (STI) region in an n-well region of a p-type semiconductor substrate. A photoresist mask having an opening that exposes a portion of the STI region, including an edge of the STI region, is formed. This photoresist mask is the only mask that must be added to the conventional logic process. A recessed region is formed in the STI region through the opening of the mask, wherein the recessed region extends into the STI region and exposes a sidewall region of the n-well. A n-type capacitor region is then formed in the sidewall region through the opening in the mask. The mask is stripped, and a dielectric layer is formed over the well region, including the sidewall region. One portion of the dielectric layer, which is formed over the upper surface of the n-well, forms a gate dielectric layer for the access transistor. Another portion of the dielectric layer, which is formed over the sidewall region, forms a capacitor dielectric layer for the capacitor structure. A gate electrode is formed over the dielectric layer, wherein a portion of the gate electrode extends into the recessed region. The access transistor of the NVM cell is then formed in a self-aligned manner with respect to the gate electrode. The capacitor structure is formed by the portion of the gate electrode located in the recessed region, the capacitor dielectric layer located on the sidewall region, and the capacitor region formed in the sidewall region. The capacitor structure advantageously provides a relatively high capacitance with a relatively small layout area.

The present invention incorporates a negative voltage generator that provides a negative boosted voltage having a voltage level that is less than the V_(ss) supply voltage by a voltage that is less than a diode turn-on voltage (0.7 Volts). In one embodiment, the negative boosted voltage has a value of −0.5 Volts. The negative boosted voltage is applied to the control gate of the non-volatile memory cell to enhance the electron removal operation and normal read operation without requiring a triple-well underneath the control gate.

The present invention also incorporates a positive voltage generator that provides a positive boosted voltage having a voltage level that is greater than the V_(dd) supply voltage by a voltage that is less than a diode turn-on voltage (0.7 Volts). In one embodiment, the positive boosted voltage has a value equal to V_(dd)+0.5 Volts. The positive boosted voltage is applied to the N-well of the non-volatile memory cell and the control gates of non-selected memory cells during normal read operations to suppress leakage currents through those non-selected memory cells and to improve operating margins.

In accordance with one embodiment of the present invention, non-volatile memory cells are used in a system-on-a-chip system. After power-up of a system-on-a-chip integrated circuit incorporating the embedded non-volatile memory cells, the contents of the non-volatile memory cells are read out and stored (with or without data decompression operations) into on-chip or off-chip volatile memory. The data contents of the non-volatile memory cells are then refreshed (through charge injection and removal) with optimum signal condition. The non-volatile memory cells then remain in an idle or standby mode substantially without a significant external electric field. If a reprogramming operation or a refresh operation is required, then the non-volatile memory cells are reprogrammed or refreshed as required and then returned to the idle or standby mode. As a result, the storage characteristics of the thin oxide non-volatile memory cells are improved.

The present invention will be more fully understood in view of the following description and drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1A is a schematic of an array of conventional non-volatile memory cells fabricated by a single-poly process;

FIG. 1B is a cross-sectional view of a non-volatile memory cell of FIG. 1A;

FIG. 2 is a top view of a non-volatile memory cell having a PMOS access transistor and an NMOS coupling gate in accordance with one embodiment of the present invention;

FIGS. 3A and 3B are cross-sectional views of the non-volatile memory cell of FIG. 2;

FIG. 4 is a schematic diagram of an array of the non-volatile memory cells of FIG. 2;

FIG. 5 is a table illustrating the operating modes of the array of FIG. 4 in accordance with one embodiment of the present invention;

FIG. 6 is a block diagram illustrating a system-on-a-chip in accordance with one embodiment of the present invention;

FIG. 7 is a flow diagram illustrating the operation of the system-on-a-chip of FIG. 6 in accordance with one embodiment of the present invention;

FIG. 8 is a top view of a non-volatile memory cell having a PMOS access transistor and a recessed capacitor structure in accordance with another embodiment of the present invention; and

FIGS. 9A-9D are cross sectional views of the non-volatile memory cell of FIG. 8 during various stages of fabrication.

DETAILED DESCRIPTION

FIG. 2 is a top layout view of a non-volatile memory cell 200 in accordance with one embodiment of the present invention. FIG. 3A is a cross sectional view of the non-volatile memory cell of FIG. 2 along section line A—A. FIG. 3B is a cross sectional view of the non-volatile memory cell of FIG. 2 along section line B—B. In the described example, non-volatile memory cell 200 is fabricated using a 0.25 micron conventional logic process having a typical gate oxide thickness of about 5 nm. Non-volatile memory cell 200 is operated in response to a positive V_(dd) supply voltage that has a nominal voltage of 2.5 Volts during normal operations, and a V_(ss) supply voltage of 0 Volts.

Non-volatile memory cell 200 is fabricated in a p-type monocrystalline semiconductor substrate 201 (see FIGS. 3A and 3B). In the described embodiment, substrate 201 is silicon. Non-volatile memory cell 200 includes a PMOS access transistor 210. Access transistor 210 includes p-type source region 211 and p-type drain region 212, which are formed in n-well 202. Source region 211 includes lightly doped source 211A and p+ source contact region 211B. Drain region 212 includes lightly doped drain 212A and p+ drain contact region 212B. A channel region 213 is located between source region 211 and drain region 212. Channel region 213 has a width of about 0.24 microns. Source region 211 is connected to a virtual-ground (VG) line and drain region 212 is connected to a bit line (BL). Field oxide 214 is located around the source, drain and channel regions as illustrated (FIG. 3A). Field oxide 214 is planarized, such that the upper surface of field oxide 214 and the upper surface of substrate 201 are located in the same plane. A thin gate oxide layer 215, having a thickness of about 5 nm, is located over the channel region 213. Gate oxide layer 215 has the same thickness as the gate oxide layers used in the logic transistors (not shown) fabricated in substrate 201. A conductively doped polycrystalline silicon floating gate 216 is located over thin gate oxide 215. Sidewall spacers 205-206 and 217-218, which are typically formed from silicon nitride or silicon oxide, are located at the edges of floating gate 216.

Floating gate 216 and thin gate oxide 215 extend laterally beyond access transistor 210 over p-type substrate 201 and n-type coupling region 221. N-type coupling region 221 is coupled to n+ word line 222. N-type regions 221-222, gate oxide 215 and floating gate 216 form an NMOS capacitor structure 220. NMOS capacitor structure 220 couples word line 222 to floating gate 216. N-type coupling region 221 is self-aligned with the edge of floating gate 216. This self-alignment is accomplished by implanting an n-type impurity using the edge of floating gate 216 as a mask, and then diffusing the impurity under the floating gate using an anneal step. N-type coupling region 221 is formed at the same time as the source and drain regions of NMOS logic transistors (not shown). Thus, no additional step is required to form n-type coupling region 221.

Similarly, n+ word line 222 is self-aligned with the edge of sidewall spacer 218. This self-alignment is accomplished by implanting an n-type impurity using the edge of sidewall spacer 218 as a mask, and then diffusing the impurity under the sidewall spacer using an anneal step. N+ word line 222 is formed at the same time as the n+ contact regions of NMOS logic transistors (not shown). Thus, no additional step is required to form n+ word line 222.

The total coupling capacitance of NMOS capacitor structure 220 is preferably significantly larger than the gate capacitance of the PMOS access transistor 210. In one embodiment, the coupling capacitance of NMOS capacitor structure 220 is about four times larger than the gate capacitance of PMOS access transistor 210. Non-volatile memory cell 200 can be fabricated using a conventional logic process, without any process modifications or special implants.

FIG. 4 is a schematic diagram of a 2×2 array of non-volatile memory cells 200, 300, 400 and 500. Non-volatile memory cells 300, 400 and 500 are identical to above-described non-volatile memory cell 200. Thus, non-volatile memory cells 300, 400 and 500 include PMOS access transistors 310, 410 and 510, respectively, and NMOS capacitor structures 320, 420 and 520, respectively. The sources of PMOS access transistors 210 and 410 are commonly connected to a first virtual ground line VG₀. Similarly, the sources of access transistors 310 and 510 are commonly connected to a second virtual ground line VG₁. The drains of PMOS access transistors 210 and 410 are commonly connected to a first bit line BL₀. Similarly, the drains of PMOS access transistors 310 and 510 are commonly connected to a second bit line BL₁. NMOS capacitor structures 220 and 320 are commonly connected to a first word line WL₀. Similarly, NMOS capacitor structures 420 and 520 are commonly connected to a second word line WL₁. Although the described array has two rows and two columns, it is understood that arrays having other sizes can be implemented by one of ordinary skill in the art.

FIG. 5 is a table that defines the operation of the array of FIG. 4 in accordance with one embodiment of the present invention.

The programming mode is described in connection with the programming of non-volatile memory cell 200. In the programming mode, electrons are selectively removed from the floating gate of the cell to be programmed. As a result, the PMOS threshold voltage (V_(tp)) of the programmed cell is more negative and therefore turned off during normal read operations. The programming mode is implemented by a direct tunneling (i.e., Fowler-Nordheim tunneling) mechanism through the gate oxide of the PMOS access transistor.

Non-volatile memory cell 200 is programmed as follows. Word line WL₀ is held at a voltage of 0 Volts, while bit line BL₀ and virtual ground line VG₀ are each held at a voltage of 6.5 Volts. In another embodiment, either bit line BL₀ or virtual ground line VG₀ is held at a voltage of 6.5 Volts, while the other line is left floating. N-well 202 is held at a voltage of 6.5 Volts, and p-type substrate 201 is held at a voltage of 0 Volts, thereby preventing the n-well/p-substrate junction from being forward biased. Under these bias conditions, an inversion layer is formed in the channel region of NMOS capacitor structure 220, and the floating gate 216 is coupled to a voltage slightly greater than 0 Volts. As a result, a high voltage drop exists across the gate oxide 215 of PMOS access transistor 210. An inversion layer is therefore formed in channel region 213 of PMOS access transistor 210, with the electric field exceeding 10 mega Volts per centimeter (MV/cm). Under these conditions, electrons in floating gate 216 tunnel out to the high voltage PMOS inversion layer.

In the present example, non-volatile memory cell 300 is also selected by the 0 Volt signal applied to word line WL₀. However, it is not desired to program non-volatile memory cell 300. To prevent electron removal from the floating gate of non-volatile memory cell 300, bit line BL₁ and virtual ground line VG₁ are each held at a voltage of 3.0 Volts. In another embodiment, either bit line BL₁ or virtual ground line VG₁ is held at a voltage of 3.0 Volts, and the other line is left floating. Under these conditions, the voltage drop across the gate oxide of PMOS access transistor 310 is substantially less than the voltage required for direct tunneling.

In the present programming example, a voltage of 3.0 Volts is applied to word line WL₁. As a result, non-volatile memory cells 400 and 500 are not selected for programming. Given the above-describe voltages on bit lines BL₀-BL₁ and virtual ground lines VG₀-VG₁, the 3.0 Volt signal applied to word line WL₁ ensures that the voltages across the gate oxide layers of PMOS access transistors 410 and 510 are substantially below the voltage required for direct tunneling. More specifically, because bit lines BL₀-BL₁ and virtual ground lines VG₀-VG₁ will be at either 6.5 Volts, 3.0 Volts or floating, the maximum disturb voltage will be 6.5 Volts minus 3.0 Volts, or 3.5 Volts. This maximum disturb voltage is therefore much less than the program voltage of 6.5 Volts.

In the described embodiment, the 3.0 Volt signal is generated by a positive voltage generator. This positive voltage generator provides the 3.0 Volt signal, which is greater than the 2.5 Volt positive supply voltage by 0.5 Volts. The 3.0 Volt signal is therefore greater than the 2.5 Volt signal by a magnitude less than a diode voltage drop of 0.7 Volts. A positive voltage generator capable of generating a positive boosted voltage which is greater than the positive supply voltage by a magnitude less than a diode voltage drop is described in U.S. Pat. No. 6,147,914, which is hereby incorporated by reference. This positive voltage generator is fabricated using elements that are compatible with a conventional logic process. Use of the 3.0 Volt signal advantageously improves the operating margin of memory cells 200, 300, 400 and 500.

In the erase mode, electrons are injected into the floating gates of memory cells 200, 300, 400 and 500, thereby making the threshold voltage (Vtp) of PMOS access transistors 210, 310, 410 and 510 more positive. As a result of the more positive threshold voltages, the erased PMOS access transistors are turned on during normal read operations. The erase operation implements band-to-band tunneling channel hot-electron (CHE) injection into the floating gates through Fowler-Nordheim tunneling mechanism of the PMOS access transistors. The erase operation is preferably performed in a sector mode, in which all memory cells sharing word lines and bit lines are erased together.

In the erase mode, word lines WL₀ and WL₁ are held at 0 Volts, and bit lines BL₀-BL₁ and virtual ground lines VG₀-VG₁ are held at −6.5 Volts. In another embodiment, either bit lines BL₀-BL₁ or virtual ground lines VG₀-VG₁ are held at −6.5 Volts, and the other lines are left floating. P-type substrate 201 and N-well 202 are both held at 0 Volts. Under these bias conditions, the floating gates of memory cells 200, 300, 400 and 500 are coupled to a voltage slightly less than 0 Volts. As a result, NMOS capacitor structures 220, 320, 420 and 520 and PMOS access transistors 210, 310, 410 and 510 are placed in an accumulation mode. A relatively high voltage drop exists across the p-type source/drain regions of the PMOS access transistors and the n-well 202. A relatively high voltage drop also exists between the floating gates and the p-type source/drain regions of the PMOS access transistors. The high electrical field conditions cause band-to-band tunneling to occur near the edges of the p-type source/drain regions, and the resulting channel hot-electrons (CHE) are accelerated and injected into the floating gates.

To read non-volatile memory cells 200 and 300, word line WL₀ is held at 0 Volts, virtual ground lines VG₀-VG₁ are held at 2.5 Volts (or some lower voltage level to suppress leakage current), n-well 202 is held at 3.0 Volts, and p-type substrate 201 is held at 0 Volts. Bit lines BL₀-BL₁ are pre-charged to 0 Volts (or some other voltage lower than virtual ground lines VG₀-VG₁). Under these conditions, read current will flow through the access transistors of non-programmed (erased) cells, while read current will be less through the access transistors of programmed cells.

The word line WL₁ associated with the non-selected cells is held at 3.0 Volts in the normal read mode, thereby turning off access transistors 410 and 510. Turning off access transistors 410 and 510 prevents current from flowing through these transistors into bit lines BL₀ and BL₁. As a result, cells 400 and 500 do not interfere with the bit line signals from the selected cells 200 and 300.

During the read operation, n-well 201 is biased at a voltage that is 0.5 Volts greater than the virtual ground lines VG₀-VG₁. This n-well biasing is referred to as “n-well back bias”. In a conventional logic process having a minimum feature size of 0.24 microns, the typical threshold voltage of a p-channel transistor (Vtp) is equal to −0.5 Volts. The n-well back bias raises the magnitude of the p-channel threshold voltage (to a voltage that is more negative). As a result, the sub-threshold leakage current is reduced in non-selected cells (e.g., cells 400 and 500) and selected cells that are programmed to be “off” (i.e., non-conductive during a read operation).

Similarly, the non-selected word line WL₁ is biased at 3.0 Volts, which is 0.5 Volts greater than the virtual ground lines VG₀-VG₁. This “gate reverse-bias” is also important to further reduce the sub-threshold leakage currents in the non-selected cells.

In an alternate embodiment of the present invention, the bias condition of a cell being programmed (e.g., cell 200 in the above-described example) can be modified to have a word line voltage of −0.5 Volts (instead of 0 Volts). This reduced word line voltage prevents turn on of the junction between word line 222 and p-type substrate 201. The −0.5 Volt word line bias, which is smaller in magnitude than a diode turn-on voltage, increases the maximum voltage across the gate oxide layer 215 without requiring higher voltage transistors to be used in negative voltage generator. The negative voltage generator used to generate a word line bias voltage of −0.5 Volts is described in U.S. Pat. No. 6,147,914.

In this embodiment, the bias condition of a cell that is not being programmed, but is in the same row as a cell being programmed (e.g., cell 300 in the above-described example) will also have a word line voltage of −0.5 Volts. To compensate for this lower word line voltage, the bit line and virtual ground line of the non-selected cell are reduced by 0.5 Volts, from 3.0 Volts to 2.5 Volts.

In this embodiment, the word lines of rows that do not have any cells being programmed are coupled to receive a word line bias voltage of 2.5 Volts. The associated bit lines and virtual ground lines are biased at either 2.5 Volts or 6.5 Volts, depending on whether the cells are in the same column as a cell being programmed. Note that the biasing of n-well 202 and p-type substrate 201 remain at 6.5 Volts and 0 Volts, respectively, in this embodiment.

For a conventional logic process having a minimum line size at or below 0.24 microns, the use of very thin gate oxides as tunneling oxide present major challenges for achieving acceptable data retention time for non-volatile memory cells. Although programming voltages may be reduced, the disturbance problem (i.e., spurious injection or removal of charges from the floating gate) during normal program, erase and read operations increases significantly due to the high electric field present in or near the thin tunnel oxide and the resultant tunneling leakage current and channel hot-electron injection leakage currents. As conventional logic processes scale down in geometry, the standard gate oxides also get scaled down proportionally (e.g., 5 nm and 7 nm for a 0.25 micron process, 3.5 nm, 5 nm and 7 nm for a 0.18 micron process, and 3 nm, 5 nm and 7 nm for a 0.15 micron process). As a result, data-retention becomes a serious problem when using the standard gate oxide as the tunnel oxide for the non-volatile memory cell. U.S. Pat. No. 5,511,020, which is hereby incorporated by reference in its entirety, describes data refreshing techniques to improve data retention time of non-volatile memory cells using very thin tunnel oxides. The data refreshing techniques of U.S. Pat. No. 5,511,020 can be applied, as necessary, to the non-volatile memory cells of the present invention. Note that such data refreshing techniques are optional, and are not required in order to practice the present invention.

Since both the tunneling current and the channel hot-electron injection current are highly dependent on the level of electric field present in or near the non-volatile memory cells, a method for operating non-volatile memory cells to minimize the frequency and duration of high electric field operations is described in a preferred embodiment of the present invention, thereby maximizing data retention time for non-volatile memory cells using very thin tunneling oxides.

FIG. 6 is a block diagram of a system-on-a-chip integrated circuit 600 in accordance with one embodiment of the present invention. Integrated circuit chip 600 includes processor or controller unit 601, various function blocks 602, non-volatile memory block 603 and on-chip volatile memory block 604. In another embodiment, on-chip volatile memory block 604 can be replaced with off-chip volatile memory chips 605. In one embodiment, function blocks 602 include at least one programmable logic block that uses volatile memory elements as control and configuration bits. At least a portion of these control and configuration bits are stored in non-volatile memory block 603. During initialization, these control and configuration bits are loaded into volatile memory block 604, thereby enabling normal operations within function blocks 602. To reduce the disturbances originated from the program, erase and read modes, the non-volatile memory cells in on-chip non-volatile memory block 603 are operated in accordance with the flow chart 700 provided in FIG. 7.

As illustrated in FIG. 7, the system-on-a-chip integrated circuit is powered-up and/or initialized during Step 701. The contents of non-volatile memory array 603 are then read during Step 702. In one embodiment, the read operation performed during Step 702 includes adaptive algorithms that sample the data content of a selected cell or cells in non-volatile memory array 603 to determine the actual threshold voltage levels for the programmed and non-programmed non-volatile memory cells. The optimum voltages for reading out the contents of the non-volatile memory cells are then selected in view of the actual threshold voltage levels. For example, if the actual threshold voltage levels are relatively low, then a lower read voltage is used. The data content stored in non-volatile memory array 603 may be compressed to reduce the capacity requirement of the non-volatile memory array 603 on chip 600. Data integrity may be further enhanced by utilizing error detection and correction (EDC) techniques during the read operation.

During Step 703, the contents read from non-volatile memory block 603 are stored in volatile memory block 604 (or optionally in off-chip volatile memory chips 605). The non-volatile memory block 603 is then controlled to enter the program and erase modes (Step 704). Program and erase operations are then performed to non-volatile memory block 603, such that the original contents of non-volatile memory block 603 are restored/reconditioned from volatile memory block 604 (or volatile memory chips 605) (Step 705). Non-volatile memory block 603 then enters the standby mode (Step 706). During the standby mode, minimal or no external biases are applied to the non-volatile memory cells in non-volatile memory block 603. Preferably, the entire non-volatile memory block 603 is powered down to 0 Volts to prevent power supply glitches or abrupt power outages from causing disturbances to the non-volatile memory cells.

As long as no interrupt is received, non-volatile memory block 603 remains in the standby mode (Steps 706 and 707). However, if an interrupt is received, then this interrupt is processed. If the interrupt indicates a power down sequence (as determined in Step 707A), then the chip 600 is powered down (Step 708). If the interrupt indicates a new program request (as determined in Step 709), then processing returns to Step 704.

As described above, refresh operations may be required in view of the thin gate oxide used in the non-volatile memory cells. Refresh of the non-volatile memory cells may be required a few times a day, once every few days or once every few weeks, depending on the particular characteristics of the cells in non-volatile memory block 603. A refresh management system, such as the one described in U.S. Pat. No. 5,511,020, is used to control the refresh operations. In one embodiment, an energy storage means, such as a battery or a capacitor, is used to supply energy for a refresh operation to the non-volatile memory if main power to the system-on-a-chip is off.

The Restore/Recondition operation of Steps 704-705 can be conditional based on whether a preset criterion for charge loss is met. In this case, optional Steps 710 and 711 are added as illustrated. Step 711 is added between Steps 703 and 704. In Step 711, it is determined whether a refresh operation is required in non-volatile memory block 603. If no refresh is required, then Steps 704 and 705 are bypassed (i.e., non-volatile memory block 603 is not restored or refreshed), and processing proceeds to the standby mode in Step 706. If a refresh operation is required in Step 711, the processing proceeds to Steps 704-705, where a refresh operation is performed.

Step 710 is an additional interrupt that indicates that non-volatile memory 603 must be refreshed. This interrupt is processed by returning processing from Step 710 to Step 704, thereby refreshing non-volatile memory 603. Because Steps 710 and 711 are optional steps, these steps are shown in dashed lines in FIG. 7.

Using the above-described steps, the disturbances from program, erase and read modes can be precisely managed and predicted to achieve maximum data-retention time and data integrity in non-volatile memory block 603. It is noted that the system operating method of the preferred embodiment described above is applicable to conventional non-volatile memory cells including stacked-gate cells, split-gate cells, nitride-oxide (metal-nitride-oxide-semiconductor (MNOS) or silicon-nitride-oxide-semiconductor (SNOS) cells, oxidized-nitride-oxide (metal-oxide-nitride-oxide-semiconductor (MONOS) or silicon-oxide-nitride-oxide-semiconductor (SONOS)) cells and their variations.

Even in the standby mode or during storage conditions (i.e., when zero or no (floating) electrical biases are applied to the non-volatile memory cells), there are internal electric fields present in the non-volatile memory cells that can cause charge loss and data retention problems. To optimize data retention time during these conditions, the internal electric fields must be minimized as well. This is accomplished in the present invention by setting the threshold voltages (Vtp) for both the programmed and erased charge states to be balanced against the internal potential levels of the silicon substrate 201 and polysilicon gate electrodes, taking into consideration the flat-band voltage levels for both the NMOS capacitor structure and the PMOS access transistor in the non-volatile memory cell. In one embodiment of the present invention, the threshold voltages of the PMOS access transistors are set equal to −0.5 Volts when the non-volatile memory cell is erased, and −1.0 Volt when the non-volatile memory cell is programmed. The difference between these threshold voltages is 0.5 Volts. Similarly, the threshold voltages of the NMOS capacitor structures are set to be equal to 0.5 Volts when the non-volatile memory cell is erased, and 0 Volts when the nonvolatile memory cell is programmed. Again, the difference between these threshold voltages is 0.5 Volts.

FIG. 8 is a top view of a non-volatile memory cell 800 having a PMOS access transistor and a recessed capacitor structure in accordance with another embodiment of the present invention. FIGS. 9A-9D are cross sectional views of NVM cell 800 along section line C—C during various stages of fabrication. NVM cell 800 is operated in substantially the same manner as NVM cell 200 (FIG. 2) and the biases (except for the p-sub bias) in FIG. 5 may be shifted up or down together optionally for easier circuit implementation without affecting each operational mode.

In general, NVM cell 800 includes a PMOS access transistor 801 and a PMOS capacitor structure 802 formed inside a recessed region 803 within a shallow trench isolation (STI) region. The PMOS capacitor structure 802 replaces the NMOS capacitor structure located outside n-well 202 in NVM cell 200 (FIG. 2). The recessed region 803 is formed by etching the STI region using one extra masking step to expose the silicon sidewall of the STI region. The gate dielectric and gate polysilicon are subsequently formed, thereby creating a relatively large coupling capacitor using a relatively small area inside the same n-well where the PMOS access transistor is fabricated. Consequently, the resulting NVM cell 800 is smaller than NVM cell 200 (FIG. 2).

Access transistor 801 includes salicide gate electrode 810, p-type source region 811 and p-type drain region 812. An n-type channel region 813 is located between source region 811 and drain region 812. Channel region 813 has a width of about 0.24 microns. Source region 811 is connected to the virtual-ground (VG) line, drain region 812 is connected to the bit line (BL), and the recessed coupling capacitor structure 802, which includes capacitor regions 814-815, is connected to a metal word line (WL). The recessed region 803 partially overlaps the gate electrode 810 and the p-type coupling capacitor electrode.

FIGS. 9A-9D are cross sectional views of NVM cell 800 during various stages of fabrication. As illustrated in FIG. 9A, an n-type well region 902 is formed in a p-type monocrystalline silicon substrate 901 in accordance with a conventional logic process. A shallow trench isolation (STI) region 903 is then formed in n-well region 902, also in accordance with a conventional logic process. A photoresist mask 910, having an opening 911, is formed over the resulting structure as illustrated. Opening 911 is located to expose an edge of STI region 903. Photoresist mask 910 is an extra mask, which is not typically used in a conventional logic process.

An etch is performed through opening 911 of photoresist mask 910, thereby creating recessed region 803 in STI region 903. Recessed region 803 exposes a silicon sidewall region 913 of n-well 902. The bottom of recessed region 803 is at least 100 nm below the upper surface of n-well 902. The bottom of recessed region 803 is preferably at least 50 nm above the bottom of STI region 903.

A p-type ion implant step is performed through opening 911 of mask 910, thereby forming p-type capacitor region 814. P-type capacitor region 814 adjusts the threshold voltage of sidewall region 913, and improves the coupling capacitance of the subsequently formed sidewall capacitor structure.

As illustrated in FIG. 9B, photoresist mask 910 is then stripped, and the conventional logic process is resumed to form a dielectric layer that includes gate dielectric layer 905A and capacitor dielectric layer 905B. In the described embodiment, this dielectric layer has an effective thickness between 3 nm and 8 nm. In one embodiment, gate dielectric layer 905A and capacitor dielectric layer 905B are formed of silicon oxide, although this is not necessary. A polycrystalline silicon layer 906 is the deposited over gate dielectric layer 905A and capacitor dielectric layer 905B to a thickness of about 150 nm. Polysilicon layer 906 fills the recessed region 803 with minimal resulting topography. A photoresist mask 920 is formed over the polysilicon layer 906.

As illustrated in FIG. 9C, polysilicon layer 906 is etched through photoresist mask 920, thereby leaving polysilicon gate electrode 906A. Polysilicon gate electrode 906A exhibits a relatively large area of contact with gate dielectric layer 905B along sidewall region 913, within a relatively small layout area.

As illustrated in FIG. 9D, the conventional logic process is then followed to form the LDD p-type drain region 907, sidewall spacers 921-922, P++ type drain region 908, P++ type capacitor region 815, and metal salicide regions 931-933. Drain regions 907-908 combine to form drain region 812. Polysilicon gate electrode 906A and metal salicide region 932 combine to form salicide gate electrode 810. P-type capacitor region 814 and P++ type region 815 combine to form a p-type coupling capacitor electrode 940. Note that capacitor structure 802 is formed by polysilicon gate electrode 906A, capacitor dielectric layer 905B and p-type capacitor electrode 940.

The right edge of the patterned polysilicon gate electrode 906A is relatively close to sidewall region 913, thereby ensuring that sidewall spacer 922 covers and protects sidewall channel region 913 and capacitor dielectric layer 905B from possible damage or shorting during the salicidation process.

The capacitor structure 802 illustrated in FIG. 9D advantageously exhibits a relatively large plate area (and therefore a relatively high capacitance) in a relatively small layout area because the capacitor is formed along sidewall 913 of the recessed region 803. PMOS capacitor structure 802 exhibits a capacitance that is larger then the capacitance of PMOS transistor 801.

Although the present invention has been described in connection with several embodiments, it is understood that this invention is not limited to the embodiments disclosed, but is capable of various modifications, which would be apparent to one of ordinary skill in the art. Thus, the invention is limited only by the following claims. 

1. A non-volatile memory system comprising: a semiconductor region having a first conductivity type; an access transistor located in the semiconductor region, the access transistor having a gate electrode located over the semiconductor region; a shallow trench isolation (STI) region located in the semiconductor region, wherein a recessed region extends into the STI region and exposes a sidewall region of the semiconductor region; and a capacitor structure that includes a capacitor dielectric layer located over the sidewall region of the semiconductor region, a portion of the gate electrode, which extends into the recessed region, and a capacitor region located in the sidewall region of the semiconductor region.
 2. The non-volatile memory system of claim 1, wherein the recessed region extends at least 100 nanometers below the upper surface of the semiconductor region.
 3. The non-volatile memory cell of claim 1, wherein the STI region extends at least 50 nanometers below the recessed region.
 4. The non-volatile memory system of claim 1, further comprising a dielectric spacer formed adjacent to an edge of the gate electrode, wherein the dielectric spacer covers the capacitor dielectric layer and the sidewall region of the semiconductor region.
 5. The non-volatile memory system of claim 1, wherein the access transistor further comprises two source/drain regions having a second conductivity type, opposite the first conductivity type.
 6. The non-volatile memory system of claim 1, wherein the semiconductor region comprises an n-well located in a p-type semiconductor substrate, and the access transistor is a PMOS transistor.
 7. The non-volatile memory system of claim 1, wherein the capacitor region is a p-type region.
 8. The non-volatile memory system of claim 1, further comprising circuitry for removing electrons from the gate by tunneling.
 9. The non-volatile memory system of claim 1, further comprising circuitry for injecting electrons into the gate electrode using tunneling.
 10. The non-volatile memory system of claim 1, further comprising circuitry for refreshing charge stored by the gate electrode.
 11. The non-volatile memory system of claim 1, wherein the access transistor has a gate dielectric layer having the same thickness as the capacitor dielectric layer.
 12. The non-volatile memory system of claim 1, wherein the access transistor has a gate dielectric layer having a different thickness than the capacitor dielectric layer. 